Visualising Typological Relationships: Plotting WALS with Heat Maps

نویسندگان

  • Richard Littauer
  • Rory Turnbull
  • Alexis Palmer
چکیده

We present a quantitative investigation of the cross-linguistic usage of some (relatively) newly minted derivational morphemes. In particular, we examine the lexical semantic content expressed by three suffixes originating in English: -gate, -geddon and -athon. Using data from newspapers, we look at the distribution and lexical semantic usage of these morphemes not only within English, but across several languages and also across time, with a time-depth of 20 years. The occurrence of these suffixes in available corpora are comparatively rare, however, by investigating huge amounts of data, we are able to arrive at interesting insights into the distribution, meaning and spread of the suffixes. Processing and understanding the huge amounts of data is accomplished via visualization methods that allow the presentation of an overall distributional picture, with further details and different types of perspectives available on demand.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Problems testing typological correlations with the online WALS

The ease with which WALS allows users to combine features from two maps and determine numbers of languages of the resulting types means that there is a danger of misusing the data from WALS to arrive at unsupported conclusions regarding typological correlations. I examine two instances where the overall numbers suggest a correlation and show that in only one of the two instances is there any re...

متن کامل

How Good are Typological Distances for Determining Genealogical Relationships among Languages?

The recent availability of typological databases such as World Atlas of Language Structures (WALS) has spurred investigations regarding their utility for language classification, the stability of typological features in genetic linguistics and typological universals across the language families of the world. Existing work on building NLP resources such as parallel corpora, treebanks for under-r...

متن کامل

seqPattern: an R package for visualisation of oligonucleotide sequence patterns and motif occurrences

4 Visualising oligonucleotide and consensus sequence densities 4 4.1 Preparing input sequences . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 4.2 Plotting dinucleotide density maps . . . . . . . . . . . . . . . . . . . . . . 6 4.3 Plotting average oligonucleotide profiles . . . . . . . . . . . . . . . . . . . . 10 4.4 Plotting consensus sequence density map . . . . . . . . . . . . . . ...

متن کامل

From Phonology to Syntax: Unsupervised Linguistic Typology at Different Levels with Language Embeddings

A core part of linguistic typology is the classification of languages according to linguistic properties, such as those detailed in the World Atlas of Language Structure (WALS). Doing this manually is prohibitively time-consuming, which is in part evidenced by the fact that only 100 out of over 7,000 languages spoken in the world are fully covered in WALS. We learn distributed language represen...

متن کامل

Can Gold " Cope " with Wals? Retrofitting an Ontology onto the World Atlas of Language Structures

0. Introduction The World Atlas of Language Structures (WALS, Haspelmath et al. 2005) is a large-scale “database of databases” consisting of 141 typological databases, covering a wide range of grammatical features, joined into one composite resource through the use of a common metadata scheme. While this metadata scheme ensures interoperability among databases across some dimensions (e.g., lang...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012